Schema AND Data: A Holistic Approach to Mapping, Resolution and Fusion in Information Integration

نویسندگان

  • Laura M. Haas
  • Martin Hentschel
  • Donald Kossmann
  • Renée J. Miller
چکیده

To integrate information, data in different formats, from different, potentially overlapping sources, must be related and transformed to meet the users’ needs. Ten years ago, Clio introduced nonprocedural schema mappings to describe the relationship between data in heterogeneous schemas. This enabled powerful tools for mapping discovery and integration code generation, greatly simplifying the integration process. However, further progress is needed. We see an opportunity to raise the level of abstraction further, to encompass both dataand schema-centric integration tasks and to isolate applications from the details of how the integration is accomplished. Holistic information integration supports iteration across the various integration tasks, leveraging information about both schema and data to improve the integrated result. Integration independence allows applications to be independent of how, when, and where information integration takes place, making materialization and the timing of transformations an optimization decision that is transparent to applications. In this paper, we define these two important goals, and propose leveraging data mappings to create a framework that supports both dataand schema-level integration tasks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Schema Mapping to Facilitate Data Integration

To integrate data from disparate, heterogeneous information sources in an open environment, data-integration systems demand a resolution of several major issues: (1) heterogeneity, (2) scalability, (3) continual infusion and change of local information sources, (4) query processing complexity, and (5) global schema evolution. Although several data-integration systems have been proposed to addre...

متن کامل

An Improved Semantic Schema Matching Approach

Schema matching is a critical step in many applications, such as data warehouse loading, Online Analytical Process (OLAP), Data mining, semantic web [2] and schema integration. This task is defined for finding the semantic correspondences between elements of two schemas. Recently, schema matching has found considerable interest in both research and practice. In this paper, we present a new impr...

متن کامل

The effectiveness of schema therapy on rumination, cognitive fusion, cognitive avoidance and neurocognitive processing in couples applying for divorce

Introduction: Divorce is a process that begins with the emotional crisis of both couples and ends with trying to resolve the conflict through entering a new success with a new role and lifestyle. The present study was conducted with the aim of the effectiveness of schema therapy on rumination, cognitive fusion, cognitive avoidance and neurocognitive processing in couples applying for divorce in...

متن کامل

i3MAGE: Incremental, Interactive, Inter-Model Mapping Generation

School of Business Informatics and Mathematics Doktor der Naturwissenschaften i3MAGE: Incremental, Interactive, Inter-Model Mapping Generation by Christoph Pinkel Data integration is a highly important prerequisite for most enterprise data analyses. While hard in general, a particular concern is about human effort for designing a global integration schema, authoring queries against that schema,...

متن کامل

Eliminating NULLs with Subsumption and Complementation

In a data integration process, an important step after schema matching and duplicate detection is data fusion. It is concerned with the combination or merging of different representations of one real-world object into a single, consistent representation. In order to solve potential data conflicts, many different conflict resolution strategies can be applied. In particular, some representations ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009